Predicting customer retention and profitability by using random forests and regression forests techniques
نویسندگان
چکیده
In an era of strong customer relationship management (CRM) emphasis, firms strive to build valuable relationships with their existing customer base. In this study we attempt to better understand three important measures of customer outcome: next buy, partial defection and customers’ profitability evolution. By means of random forests techniques we investigate a broad set of explanatory variables, including past customer behavior, observed customer heterogeneity and some typical variables related to intermediaries. We analyze a real-life sample of 100,000 customers taken from the data warehouse of a large European financial services company. Two types of random forests techniques are employed to analyze the data: random forests are used for binary classification, whereas regression forests are applied for the models with linear dependent variables. Our research findings demonstrate that both random forests techniques provide better fit for the estimation and validation sample compared to ordinary linear regression and logistic regression models. Furthermore, we find evidence that the same set of variables have a different impact on buying versus defection versus profitability behavior. Our findings suggest that past customer behavior is more important to generate repeat purchasing and favorable profitability evolutions, while the intermediary’s role has a greater impact on the customers’ defection proneness. Finally, our results demonstrate the benefits of analyzing different customer outcome variables simultaneously, since an extended investigation of the next buy partial defection customer profitability triad indicates that one cannot fully understand a particular outcome without understanding the other related behavioral outcome variables.
منابع مشابه
Customer churn prediction using improved balanced random forests
Churn prediction is becoming a major focus of banks in China who wish to retain customers by satisfying their needs under resource constraints. In churn prediction, an important yet challenging problem is the imbalance in the data distribution. In this paper, we propose a novel learning method, called improved balanced random forests (IBRF), and demonstrate its application to churn prediction. ...
متن کاملActivity– level as a link between customer retention and consumer lifetime value
Customer activity has received more attention due to the increase of social network applications. Moreover, customer activity could be an answer to the research debate about the significant relationship between retention rate and lifetime profitability of customers. Several researchers believe that an increase in the retention rate of customers may enhance their customer lifetime value (CLV), o...
متن کاملPredicting customer loyalty using the internal transactional database
Loyalty and targeting are central topics in Customer Relationship Management. Yet, the information that resides in customer databases only records transactions at a single company, whereby customer loyalty is generally unavailable. In this study, we enrich the customer database with a prediction of a customer's behavioral loyalty such that it can be deployed for targeted marketing actions witho...
متن کاملCustomer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting
Customer Relationship Management (CRM) enjoys increasing attention as a countermeasure to switching behaviour of customers. Because foregone profits of (partially) defected customers are significant, an increase of the retention rate can be very profitable. In this paper, we focus on the treatment of a company’s most promising customers in a non-contractual setting. We build a model in order to...
متن کاملAnalysis of Machine Learning Techniques for Heart Failure Readmissions.
BACKGROUND The current ability to predict readmissions in patients with heart failure is modest at best. It is unclear whether machine learning techniques that address higher dimensional, nonlinear relationships among variables would enhance prediction. We sought to compare the effectiveness of several machine learning algorithms for predicting readmissions. METHODS AND RESULTS Using data fro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Expert Syst. Appl.
دوره 29 شماره
صفحات -
تاریخ انتشار 2005